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TCP Buffer Dynamics 


Gabriel ISTRATE! 


Abstract 


Associate to each sequence A of integers (intending to model packet 
IDs in a TCP/IP stream) a sequence of positive integers of the same 
length M(A). The 2’th entry of M(A) is the size (at time 7) of the 
smallest buffer needed to hold out-of-order packets, where space is 
accounted for unreceived packets as well. Call two sequences A, B 
equivalent (written A =p B) if M(A) = M(B). 

For a sequence of integers A define SUS(A) to be the shuffled-up- 
sequences reordering measure defined as the smallest possible number 
of classes in a partition of the original sequence into increasing subse- 
quences. We prove the following result: any two permutations A, B of 
the same length with SUS(A), SUS(B) < 3 such that A =rp B are 
identical. The result is no longer valid if we replace the upper bound 
3 by 4. 

We also consider a similar problem for permutations with repeats. 
In this case the uniqueness of the preimage is no longer true, but 
we obtain a characterization of all the preimages of a given sequence, 
which in particular allows us to count them in polynomial time. 

The results were motivated by explaining the behavior and engi- 
neering RESTORED, a receiver-oriented model of traffic we introduced 
and experimentally validated in earlier work. 
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1 Introduction 


The TCP protocol [17] is the fundamental protocol for computer communi- 
cations. TCP breaks the information into packets, and attempts to maintain 
a ordered packet sequence to be passed to the application layer. It accom- 
plishes this by buffering packets that arrive out-of-order. 

Work in the area of network traffic modeling has brought to attention 
the significant impact of packet reordering on the dynamics of this protocol 
(2, 3, 12]. This has stimulated research (mainly applied) on measuring 
and modeling reordering [14, 15], and on quantifying the impact of packet 
reordering on application performance. 

In this paper we study a combinatorial problem motivated by modeling 
packet reordering in large TCP traces: suppose that we map a sequence A 
of packet IDs into the sequence of integers M(A) representing the different 
sizes of the buffer space necessary to store the out-of-order packets; we 
assume that space in the buffer is reserved (and accounted) for unreceived 
out-of-order packets as well. What kind of additional information on the 
sequence A is needed to identify A, given M(A) ? 

The problem arose in the context of RESTORED [9], a method for 
receiver-oriented modeling and compression of large TCP traces. In an 
experimental paper [9] we showed that RESTORED is able to regenerate 
sequences similar to the original sequences with respect to several reorder- 
ing metrics. One metric for which this result is true was reorder density 
(RD) from [10, 15, 16]. We found the experimental result for RD para- 
doxical for the following reason: RESTORED guarantees that the regener- 
ated trace is (locally) similar to the original sequence for a precise notion 
called =rg-equivalent (rigorously explained below). On the other hand 
=prp equivalence does not uniquely determine the value of measure RD; 
thus reconstructed sequences have no special reason to have the same RD 
value as the original sequence: they could get any value compatible with 
FB equivalence. 

Though possible in principle, the scenario we outlined never happened 
in our experiments with metric RD in [9]. One could attribute this either 
to the particularities of our reconstruction method or to the existence of 
“extra structure” in real-life TCP traces that would somehow preclude in- 
consistency. 

The theoretical result in this paper (Theorem 1 below), together with 
an experimental observation we made in [7] (that over 99% of the traces 
we previously considered for benchmarking RESTORED have values of the 
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SUS reordering measure at most 3), makes this result somewhat less para- 
doxical: indeed Theorem 1 proves that all reordering patterns arising from 
permutations (i.e. sequences with no repeated or lost packets) satisfying 
condition SUS < 3 have at most one preimage with these properties; There- 
fore all reordering measures are consistent on them. 

On the other hand Theorem 2 below provides (as discussed in Section 
5.2 below) a way to design a better encoding for the unordered phase of RE- 
STORED (see below); the advantage (compared with the method described 
in [9]) is an easier method to randomly sample preimage sequences. 

To sum up: the two theoretical results in this paper (Theorems 1 and 2) 
serve to explain some paradoxical behavior of a software system we had 
previously designed for inference of large network traces and help to better 
engineer it. 


2 Preliminaries 


We first give a brief primer on the relevant aspect of the TCP protocol, 
RESTORED and the concepts used in the sequel. 


2.1 A Brief Introduction to Networking 


The TCP protocol [17] attempts to maintain an ordered stream of data 
bytes, identified by an integer called byte ID, that is effectively communi- 
cated through the network by breaking it down into packets. The ordering 
is maintained by buffering out-of-order packets. Buffer dynamics can be 
described in part using several parameters: 


1. The first parameter is NextByteExpected, and is the smallest index of 
a data byte that has still not been received by the receiver. 


2. A second, related, parameter is LastByteRead, the index of the last 
byte processed by the receiver-side application that communicates 
through the network via the TCP protocol. Throughout this paper 
we will make the simplifying assumption that data is read by the ap- 
plication as soon as it is ready. In other words NextByteExpected = 
LastByteRead+1. 


3. Another parameter is LastByteRcvd, the index of the last byte that 
has arrived at the receiver, awaiting processing. 
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4. RcvWindow, the size of the receiver window, is a parameter that is 
meant to provide the sender with an estimate of the available buffer 
space at the receiver. 


5. Finally, RcevBuffer is a implementation-dependent system constant, 
the size of the receiving buffer. 


The functioning of the TCP protocol ensures that these four parameters 
are related through the relation ([11] section 3.5): 


RevWindow = RevBuffer — (LastByteRcvd — LastByteRead). (1) 


The term in parentheses on the right-hand side is the actual size of the 
TCP receiver-buffer. The measurement takes into account space reserved 
(but not necessarily used) for all packets from the first expected to the 
last arrived. This is, of course, proportional to the buffer size measured in 
packets rather than bytes if all packets have the same size. 

TCP is receiver-driven: that is, the receiver attempts to maintain con- 
trol on the sender flow stream by directing the sender speed, and acknowl- 
edging the received packets. An acknowledgment (shortly, ACK) generally 
consists of the ID of the first packet that has not yet been received. Acknowl- 
edgment mechanisms vary across different implementations, and can entail 
delayed or selective acknowledgments, urgent retransmission requests, etc. 
From our standpoint, what is important is that we can associate a sequence 
of integer ACKs to every sequence of packet IDs. In this paper we will use 
the following simple mechanism: ACK; is the integer that would be sent if 
the receiver would immediately ACK every received packet. In other words 
ACK; is the smallest index of a packet unreceived at stage 7. 


Example 1 Consider the following hypothetical sequence of packet IDs: 
A=(4 3 2 1). Then the sequence of ACKs is ACK(A)=(1 1 1 5). 
Indeed, at the beginning the protocol is waiting for packet 1, and ACKing 
correspondingly. After packet 1 has been received all packets can be sent to 
the application layer, hence the first unreceived packet has index 5. 


2.2 A Primer on RESTORED 


RESTORED [9] is a Markovian model of large TCP traces that incorporates 
information on the dynamics of packet reordering. It can be used to pro- 
vide estimates of various measures of quality of service without making 
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these measurements online, or storing the entire sequence. Rather, it first 
“compresses” the trace into a small “sketch” that allows regeneration of a 
TCP trace with (hopefully) similar characteristics. If needed, we can then 
perform a large number of measurements on the regenerated trace. Thus, 
the way RESTORED is envisioned to work involves the following steps: 


1. “Learn” Markovian models (and their associated parameters) instead 
of storing large TCP traces. 


2. Use these models to reconstruct similar sequences. 


3. Use these sequences to estimate measures of quality of service for the 
original sequences. 


Of course, one could hope to further add features to the model, beyond 
the details provided by RESTORED by, for instance, finding efficient ways 
of comparing and clustering connections with similar traffic properties. We 
will not discuss these engineering aspects further, but concentrate on some 
mathematical aspects relevant to engineering RESTORED . 

For the purposes of the present paper, a connection is simply a sequence 
of integers (packet IDs). Suppose that the receiver observes the following 
(hypothetical) packet stream 


123 65 74 8 9 10 12 13 14 11. 


In this example packets with IDs 4,5,6,7,12,13,14 and 11 arrive out of 
order. One can, consequently, classify the received packets into two cate- 
gories: those that can be immediately passed to the application layer, and 
those that have to be temporarily stored before delivery. In the example, 
packets 5, 6, and 7 are temporarily buffered, and the buffer is only flushed 
when packet 4 is received. Similarly, packets 12, 18, and 14 are temporarily 
buffered, and the buffer is flushed when packet 11 arrives. We will call a 
packet that marks the end of a sequence of consecutively buffered packets 
a pivot packet. Packets that are immediately delivered to the application 
layer are also trivially pivots. In our example this is the case for packets 1, 
2, 3, 4, 8, 9, 10 and 11. 

The distinction we introduced effectively defines a partition of the 
stream of packet IDs into segments. A segment of packets is bounded by 
pivot packets. There are two possibilities for describing packet dynamics 


e packets arrive in order. The largest consecutive subsequence of packets 
for which this property holds will be represented in RESTORED as the 
occurrence of an ordered state. 
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Figure 1: Markovian model of packet dynamics in RESTORED. 


e there is reordering and buffering. Such a segment bounded by consec- 
utive pivot packets form an occurrence of an unordered state. 


Each occurrence of the ordered state is followed by one or more occur- 
rences of the unordered state. It was the contribution of [9] to show that 
one can use a Markov Chain with the two states (see Figure 1) to represent 
the dynamics of packet IDs of real traces. 

The sequence of packet IDs in the ordered state is trivial by definition: 
they arrive in order, starting with the first expected packet. For the un- 
ordered state we proposed in [9] to deal with packet IDs in the unordered 
states by encoding reordering patterns into “sketches” via a many-to-one 
mapping M defined on sequences of packet IDs. The mapping reported in 
[9] was the following: 


Definition 1 Let A = {Aj, Ag,...,An} be a sequence of packet IDs (con- 
sidered on the receiver side). First define: 


e H; is the highest ID in the sequence Aj, Ao,..., Aj. 


e L; is the largest ID of a packet among Aj, Ao,...,A; that can be up- 
loaded, zero if none of them can. Equivalently L; = ACK; — 1. 


Further, define the buffer sequence M(A) associated with sequence A as 
follows. M(A) = (M1,...,Mn) with 


Mes Hi De: (2) 


In other words, M is the size of the smallest buffer large enough to store 
all packets that arrive out-of-order, where the definition of size accounts for 
reserving space for unreceived packets with intermediate IDs as well. 

Two sequences of packet IDs P and Q are full buffer (FB) equivalent 
(written P =pp Q) if M(P) = M(Q). 


Example 2 Let A=(4 3 2 1). Then M(A)=(4 4 4 O). 
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The mapping M is many-to-one, but a preimage (when it exists) can 
be computed in polynomial time [8]. This was used in the regeneration 
algorithm, where first we use the Markov chain to sample a sequence of 
ordered and unordered states. In any occurrence of the ordered state we 
simply sample from the distribution of possible lengths of such sequences. 
On the other hand, in each occurrence of the unordered state we first sample 
a sketch S' from the distribution of such sketches and then reconstruct a 
preimage (via M) of S. 

Furthermore, mapping M defined this way provides a formal way to 
guarantee that the reconstructed sequence is locally “similar” to the original 
one. The formal notion of similarity has implication for the dynamics of the 
TCP protocol: 


Definition 2 Two packet sequences A, B are behaviorally equivalent if they 
yield the same sequence of AC Ks. 


Suppose now that a TCP implementation uses simple (as opposed to 
selective or cumulative) ACKs , and acknowledges every single packet. Then 
two traces that map (via M) to the same sequence are behaviorally equivalent 
[6]. As the dynamics of the congestion window is receiver-driven, assuming 
identical network conditions for the ACK sequences, such traces can be 
regarded as “equivalent,” from a receiver-oriented standpoint. 

We will also need a standard measure of disorder in almost-sorted se- 
quences [4]. This measure has the name shuffled up-sequences (SUS). and 
is defined as follows: 


Definition 3 Given a sequence of integers A denote by SUS(A) the mini- 
mum number of ascending subsequences into which we can partition A. 


For example, sequence 
A = (6,5, 8,7, 10, 9, 12, 11, 4, 3, 2) 


has SUS(A) = 5, a decomposition into a minimal number of increasing 
subsequences being 


A = (6,8, 10, 12) U (5,7, 9, 11) U (4) U (3) U (2). (3) 
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3 Main Result 


In this section we will prove our main result: 


Theorem 1 Let A, B be permutations of length n with SUS(A),SUS(B) < 
3 such that A=prp B. Then A= B. 


Observation 1 The theorem is no longer true if we replace the condition 
with SUS(A), SUS(B) < 4. This is witnessed by sequences (4 3 2 1) 
and (4 2 3 1). Indeed A=pp B, since they both map to (4 4 4 0). 
In fact SUS(A) = 4, SUS(B) = 3. 


Proof: 

We consider the algorithm SUSGreedy with the pseudocode given be- 
low. The algorithm is related to patience sorting [1] and has been implicitly 
shown to compute measure SUS in [13]; the reason is that SUS coincides 
[13] with another reordering measure denoted by LDS, defined as follows: 


Definition 4 Let A = (aj, a2,...,@n) be a sequence of nonnegative integers. 
LDS(A) is defined as the longest length of a decreasing subsequence ai, > 
Dig >... Ai; A<%4 <tg<... <4; <n} of A. 


But it is well-known [1] that patience sorting and, consequently, the 
algorithm SUSGreedy computes parameter LDS (to make the paper self- 
contained we will reprove this result below). 


Algorithm 3.1: SUSGREEDY(W) 


INPUT W = (w}, wo,..., Wn) a list of integers. 
let i=1,j=1 
let L, be the empty list 


attempt to add p; to the first list D:,t < J, 
where it can be added while keeping it sorted 
if no such list exists 

‘omar 
{ create new list L; = {pi} 
Laps 
let uw be the number of lists created by the algorithm 


while (i <n) 


return (u = LDS(W) = SUS(W)). 
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Algorithm 3.2: RECONSTRUCT(W) 


INPUT: list W = (wy, wa,...,Wn) of positive integers 
in the range 1...n 


# one cannot buffer a single packet: 
if some w; is 1 return NO PERMUTATION EXISTS 


let PACK ET and ACK be integer vectors of size n, 
with all fields — 1 
let ACK [0] = 1. Also let wo = 0. 


fori=lton 
if wi < wy_-1 


# buffer shrank 
# just arrived packet was the first unreceived 
PACKET{|i] = ACKii - 1] 
AC K/i] — ACK {i _ 1] + (ws. _ Wi) 
do 
else 


ACK(i] = ACK{i — 1; 
if (wi > Wi-1) 
then 
PACKET(i| = ACK; +u;-1 


do 


fori =1ton 
if (wi = Wit) 
let PACK ET{i| be the smallest positive 
integer not in {PACK ET[j]}; 


if (PACKET is a permutation of {1,...,n}) return PACKET 


else return NO PERMUTATION EXISTS 


We next consider a simple algorithm called RECONSTRUCT, its pseu- 
docode being given above, that, given a sequence W of positive integers con- 
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structs (if possible) a permutation A of the same size such that SUS(A) < 3 
and M(A) = W. The proof that the algorithm is correct will imply the 
uniqueness of sequence A. 

We prove the correctness of algorithm RECONSTRUCT in a couple 
of intermediate steps. The first two apply to a general sequence A (rather 
than one with SUS(A) < 3). 


Lemma 1 Suppose there exists a permutation 7 with M(m) = w. Then the 
following are true for any i> 1: 


1. The last element added to list L; is at least as large as any element 
present in some list Ly, k > 7. In particular the largest element of Ly 
is the maximum element seen so far. 


2. If element x is the largest element seen up to step i then x = ACK; + 
Wi 1. 


Proof: Let i= 1. Statement 1. is clearly true. For the second statement, 
note that ACK, = 2 and w; = 0 if x = 1 (in-order packet) otherwise 
ACk, = 1, tS es 

Consider now the case 7 > 1. By the induction statement, the largest 
element seen so far (call it y) is the last element of ZL; and y = ACK;_; + 
Wi-1 — 1. 

Case 1: x is added to Lj. By the definition x > y so = is the largest 
element seen so far. Moreover, since x is an out-of-order element we have 
ACK; = AC K;_1 and w; = wi-1+ 2 -y. 

Case 2: x is added to some other list L;. If x is the first element 
of the new list then statement 1 follows immediately. Otherwise let z be the 
largest element of list L; before adding x. Applying the induction hypothesis 
it follows that z is the largest element in lists L,, k > 7. But z < x (since we 
add x to list L;). Thus x becomes the new largest element of lists L;, k > j. 
As for the second statement, from the algorithm it follows that x < y so y is 
still the largest element seen so far. If the buffer size does not modify then 
the desired relation follows from y = AC Kj_1 + wj-1 — 1 (which holds by 
induction) and relations ACK; = AC Kj; and w; = w;-1. Otherwise the 
buffer shrinks by amount AC K;-— AC K;_1, so wjy_-1 -—w; = ACK; -ACKj_}. 
We infer the fact that 


y = ACKi-1+we-1—1= ACK; — (ACK; — ACKi-1) + wi-1 -—1= 
ACK; + (wi — wi-1) + wi-1 — 1 = ACK; + w; - 1. 
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thus proving Lemma 1. 


Corollary 1 Algorithm SUSGreedy correctly computes u = LDS(A) (which 
is equal to SUS(A) /13/). 


Proof: Let B= aj, > di, >... > Gizpg 4) be a decreasing subsequence 
of W of maximum length, and let Li, Lo,...,L; be the lists created by the 
algorithm on input sequence A. Each list Lx is increasing, so it contains 
at most one element from B. Therefore u > LDS(A). On the other hand, 
each element am set by the algorithm to a list Ly, k > 2 is smaller than 
some element ap, n < m, set by the algorithm to list Lx_1 (otherwise a,, 
would be set to a list Lj, 7 < k). Applying this observation starting with 
the last element of list D,, we create a decreasing sequence of length u. It 
follows that u < LDS(A), thus u = LDS(A). 

From now on we assume the fact that there exists a permutation A 
with SUS(A) < 3 such that M(A) = w. We will run the algorithm SUS- 
Greedy along algorithm RECONSTRUCT. First we give a simple corollary 
of Lemma 1: 


Corollary 2 Suppose that w; > w;j_1. Let y be the largest ID of a packet 
received in stages 1 toi —1 and x be the ID of the new packet. Then 
x=yt(w;— wii) and x is added by SUSGreedy to list Ly. 


Next we deal with another possible case, the one when the buffer size 
shrinks: 


Lemma 2 (a). Let packet ID x be added at stage i, and assume that 
wi < wj1. Then x = ACK;_; and all packets with indices at most 
ACK;j_-1 + (wj-1 — w; — 1) have been received in the first i stages. 


(b). Suppose packet ID x is added by algorithm SUSGreedy to list L3. Then 
packet x falls into case (a) of this lemma. 


Proof: 


(a). The fact that « = AC K;j_1 follows from the definition of parameter 
ACK and the fact that the buffer shrinks. The second relation follows 
from the fact that the buffer shrinks by exactly wj_1 — uj. 
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(b). Since x goes in list D3, at the time when it is added x is smaller than 
the last element in lists Ly; and Lo. If x were larger than AC'Kj_-1 
then the packet with index AC K;_; (which arrives sometimes after x 
does) could not be placed in lists Ly, Lz or L3, making the sequence 
A require SUS(A) > 4, a contradiction. 


The other two relations follow from the definition of parameter AC'K;. 


Lemma 3 In the conditions of Theorem 1 there exists at most one permu- 
tation t with M(r) = w, and it is the one found by algorithm RECON- 
STRUCT. 


Proof: 

This follows easily: if w; 4 w;_1 then by Corollary 2 and Lemma 2 the 
ID of the packet is uniquely determined. To prove the Lemma we have to 
show that this is true for the case w; = wj_; as well. We claim that if a 
packet ID z is set at stage 7 in the second FOR loop then it must correspond 
to adding x to list Lo. Indeed, if x were added to L; then it would be the 
largest element seen so far (hence we would have w; > wj_1). The fact that 
x cannot be added to Ls follows from Lemma. 2. 

Since list Lo is sorted, x is the smallest element that has not been 
set up to this stage. This constraint uniquely determines the value of 2, 
thus proving Lemma 3. 


This concludes the proof of Theorem 1. 


4 Extension to Permutations with Repeats 


In [8] we have also considered extending encodings of permutations with 
repeats, defined as follows: 


Definition 5 A permutation with repeats is a sequence of integers A = 
Aj,..-,;Am such that, 


1. for some n> 1, {Aj, Ao,..., Am} = {1,2,...,n} (as sets). 


2. For any repeat packet A;, we have A; > ACK;. 
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The second condition is a reasonable assumption from the standpoint of 
TCP behavior: namely, repeats of a packet that has already been uploaded 
to the application layer are removed from consideration. 

In that paper the encoding of permutations was not M, but a different 
map B. In the sequel we define a similar extension of map M: 


Definition 6 Let A be a permutation with repeats. Let A the subsequence 
of A obtained by eliminating all repeats. Define M as follows: M(A) is 


obtained by inserting —1’s into M(A) in positions corresponding to repeat 
packets. 
Also, for two packet sequences define A =z, B iff M(A) = M(B). 


Example 3 /f 
A=(4 5 62 4 1 3) 
then 
M(A)=(4 5 6 6 —1 4 O). 


Theorem 1 does not extend to the case of permutations with repeats: 


Example 4 Let A, B be the following two sequences. 
A=(4 5 6 2 4 1 3) 


and 
B= (4 65 6" 2 25> tS) 


Then A =p B and SUS(A),SUS(B) = 3. 


However, as we show in the proof of the following result we can easily 
obtain a compact representation of all sequences in the preimage of a given 
string. This representation allows us to count the number of such sequences 
in polynomial time: 


Theorem 2 There exists an algorithm that, given a sequence P = P,... Py 
of positive integers, counts the number U of permutations with repeats A = 
A, Ag,...,An such that A € M1 '(P) and SUS(A) < 3 in time polynomial 
in the sizes of P and U. 
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Proof: We will attempt to give a characterization of sequences A with the 
properties listed in the above Theorem that will enable us to count them 
efficiently. 

First, note that the conclusions of Lemmas 1 and 2 are still true for 
permutations with repetitions as well, since in their proofs we never used 
the fact that we are getting distinct elements. 


Lemma 4 No repeat packet is added to list Ls. 


Proof: This is a simple consequence of the fact that every packet added 
to list L3 is (by Lemma 1) an ACK packet. Once received, such an ACK 
packet is no longer considered (by the convention we made in Definition 5) 
among repeats. 


Lemma 5 Consider a permutation with repeats A with the properties in 
Theorem 2 and its subsequence A obtained by removing all repeats. Then 
packets of A that belong to A are set when running SUSGreedy(A) on the 


same list as when running SUSGreedy(A). 


Proof: For lists Ly and L3 this follows directly from Lemma 1 and 
Lemma 2 for permutations with repetitions. 

On the other hand, a packet added by SU.S(Greedy(A) to Lz could 
only be added by SUSGreedy(A) to Lz or L3, because, by Lemma 1 packets 
added to Ly, are the largest seen so far, which the given packet is not. 

It cannot, however, be added by SUSGreedy(A) to L3 because, by 
Lemma 2 for permutations with repetitions, it would have to be an ACK 
packet; however we assumed it’s a repeat. 

With Lemma 5 settled, proving Theorem 2 is easy: first we eliminate 
the —1’s from sequence W and apply Theorem 1 to recover the unique 
permutation in the preimage. That allows us to obtain the values A; for 
all indices i such that W; # —1. In particular let Iz be the (ordered) set 
of indices 7 for which packet A; goes to list Lz. We can also simulate the 
buffering process and determine, for any such 7 the set B; of packets buffered 
at moment 2, excluding the largest such packet. 

It remains to determine the possible values A; for those positions cor- 
responding to repeats (i.e. W; = —1). Such a packet can either be: 


1. A repeat of the largest packet on the list L, at time 7, or 


2. A repeat of a packet that is buffered at stage 7 that will be added by 
SUSGreedy(A) on list Lo. 
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We will call indices 7 positions of type 1 and of type 2, based on which 
of the two cases applies. 

Repeats of the first type do not create any additional constraint for 
subsequent packets, as they don’t modify the value of the largest packet on 
list £1, or other parameters (ACK, etc). Also the value of such a packet 
is uniquely determined: it’s simply a copy of the last element on list Ly 
(if P; = —1 simply return 0, as the sequence cannot start with a repeat 
packet). 

As for repeats of the second type, there are two constraints they have 
to satisfy: 


1. First, any such packet has to be a repeat of a copy of a packet that is 
buffered at the moment it is added onto Lz, different from the largest 
buffered packet (or else it would be added to L1). 


2. Their addition to Lz must result in the maintenance of an increasing 
order on this list. 


The first condition simply states that A; € B;. The second condition 
requires some care. Let 1 <i <n and let 7; < i < ig be the consecutive 
moments in [2 when non-repeat packets are added to Ly. Then the value 
A; of the repeat packet must correspond to a value Aj, < A; < Aj,. One 
of 71,72 might fail to exist, in which case we replace the missing limit in the 
previous inequality by 0 (in the case of the lower value) respectively +00. 

Define, for each i with P; = —1, 


C; = BEN (Ai, , Ais) 
Thus, to specify a sequence A as in Theorem 2 we have to specify: 
e (i): which of the indices 7 with P; = —1 are of the second type. 


e (ii): an integer A; € C; for any such i such that the resulting sequence 
is monotonically nondecreasing. 


The conclusion of this argument is that permutations with repeats A 
in the preimage of W are in bijective correspondence with nondecreasing 
partial functions f defined on a subset of indices i € {j|W; = —1} with the 
restriction that for any such i such that f (2) is defined f(t) € C;. 

We can count such sequences in polynomial time by dynamic program- 
ming as follows: 
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1. we start with an empty table and go in increasing order of indices 7. 
2. We count partial functions by their largest value. 


3. Given a possible value A; in the list C;, its count is one plus the sum 
of all the counts of sequences ending in values of the table less or equal 
to A; taken from previous stages. 


4. The output value is one (corresponding to the empty sequence) plus 
the sum of all the counts of elements in the table. 


Example 5 We illustrate the above dynamic programming algorithm on 
three sets (without necessarily corresponding to packet sequences), A, = 
{1,2}, Az = {2,3}, Az = {3,4}. The table corresponding to counting the 
partial functions is displayed in Figure 2. 


Stage index |} count 

1 1 
1 2 1 
2 2 3 
2 3 3 
3 3 9 

4 9 
TOTAL - 27 


Figure 2: The dynamic programming counting algorithm on data from Ex- 
ample 6 


It is clear that the dynamic programming runs in polynomial time, thus 
completing the proof of Theorem 2. 


5 Applications to RESTORED 


5.1 Explaining RESTORED Behavior 


The result we just proved allows the reinterpretation of results in [7, 9]. In 
that paper it was shown experimentally that RESTORED is able to recover 
several measures of quality of service, among them the following metric [10]. 
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For ease of presentation, our version of the metric is presented in the case 
of permutations (i.e. sequences with no repeats or packet losses). See [10] 
for the general definition: 


Definition 7 Reorder Density (RD). 
Consider an implementation-dependent parameter DT that is a positive 
integer or co. Given a permutation 7 we define the reorder density of 7 as 


the distribution of displacements i — [i], restricted to those displacements 
in the range [—DT, DT]. 


In general it is simply unreasonable to expect to perfectly recover an 
arbitrary measure W of reordering. The reason is that RESTORED replaces 
an input sequence A with a sequence B from the preimage F~'(F(A)). 
unless W is mostly constant on set F~'(#(A)) one cannot expect that the 
two values of W for the original and reconstructed sequence will be identical. 

The above discussion motivated the following definition from [7] (stated 
here for simplicity for =,g): 


Definition 8 A metric M is consistent with respect to =prp if for any two 
ID sequences A and B, A=pp B = > M(A) = M(B). In other words, a 
consistent measure M takes equal values on equivalent sequences. 


Example 6 By equation (1), every measure defined in terms of the time 
series of parameter RcwWindow (e.g. the average value of this parameter) 
is consistent with respect to =rp. 


In particular, since RESTORED (in the form used in [7, 9]) guarantees 
that, on sequence A it will reconstruct a sequence R(A) such that R(A) =rp 
A, it is not really that surprising that RESTORED should be able to capture 
any metric consistent with respect to =rp. 

The reason that we found the experimental results from [9] somewhat 
paradoxical is that RD is an example of an inconsistent measure (at least 
on “theoretical” examples) according to the terminology of Definition 8. 


Observation 2 If A = (4321) and B = (4 2 31) then the distributions 


of displacements are D(A) = ( Yr o ) and D(B) = 


( 4 : : i respectively. It is easy to see that, no matter how we 


1/4 1/2 1/4 
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set the parameter DT to either a positive integer or oo, the truncated ver- 
sions of distributions D(A), D(B) are going to be different. Thus A=rp (B) 
but D(A) 4 D(B), which means that measure RD is inconsistent indepen- 
dently of the value of threshold parameter DT. 


However, Theorem 1 forces us to reevaluate the real-life applicability of 
this simple example: since the vast majority of traces used in the experimen- 
tal benchmarking from [9] had SUS < 3 the “theoretical” inconsistency of 
RD displayed by the example in Observation 2 is less stringent “in practice”: 
in particular since no two permutations map to the same reordering pattern 
and, crucially, repeat packets are discarded before computing RD [10], it is 
no longer that surprising that RD could be largely consistent “in real life”, 
even though theoretically inconsistent. 


5.2 Redesigning RESTORED 


As hinted in [9] (and witnessed by our subsequent papers [7],{8]) mapping 
M was by no means the only many-to-one mapping we tried in developing 
RESTORED. Indeed, in these subsequent papers we mathematically ana- 
lyzed a version of the function F'B, that only takes into account effectively 
buffered packets in the definition of buffer size, and showed that it has good 
properties. 

More generally, the conditions we want any good compression methods 
for reordering patterns F to satisfy are the following: 


1. Mapping ¥ has to be easily computable. 


2. Mapping F should have many preimages of a given coded sequence, 
especially those arising from “real-life” observations. 


3. Mapping F has to be invertible in polynomial time: there has to be a 
polynomial time algorithm that, given a coded sequence X computes 
a preimage A € F~!(X). 


4. A stronger form of condition 3. requires to be able to sample in 
polynomial time (almost) uniformly from the set of preimages A € 
F-1(X) of a coded sequence X. 


The first condition ensures good compression. The second one ensures 
computational tractability. Condition 4 (and its weaker version 3) is used 
in the regeneration phase of RESTORED . 
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In [6] we have shown that mapping FB satisfies condition 3. Stronger 
condition 4 is also true (but this was not explicitly stated /shown in that 
paper there), since sampling matchings in a bipartite graph has a fully 
polynomial randomized approximation scheme. The sampling algorithm is 
too complicated to be of practical use, and was never implemented as part 
of RESTORED . 

For mapping FB (that somewhat differs from FB on sequences with 
repeat packets), the following simple Corollary of Theorem 2 shows that the 
situation is much better: a simple sampling algorithm can be obtained by 
adapting the algorithm used to count preimages: 


Corollary 3 There exists an algorithm that, given a sequence of integers P 
generates a random permutation with repeats A € M '(P) and SUS(A) < 3. 


Proof: In the proof of Theorem 2 we counted permutations with repeats 
with the desired properties by mapping them bijectively to nondecreasing 
partial functions with a special structure. 

The problem of counting such nondecreasing partial functions is easily 
seen to be self-reducible: if 7 is the smallest index with P,;, such a partial 
function f is in one of the following situations: 


e f is undefined at 7, and in bijective correspondence with a partial 
function on the (smaller) domain obtained by eliminating 7. We can 
use the previous algorithm to compute Py, the number of such partial 
functions. 


e f assumes a value A € C;. In this case the restriction of f to the 
domain obtained by eliminating 7 is a similar object (a nondecreasing 
partial function with a similar special structure). For every \ we can 
use the previous algorithm to compute P), the number of such partial 
functions. 


Now it’s easy to generate a random nondecreasing partial function f: 
: : F aq Po 
1. f will be undefined at i with probability PoF3-x Px’ 
. . aye P 
2. f will assume a value \ € C; with probability Poty Pr’ 


3. Quantities Py, P, can be efficiently computed using the dynamic pro- 
gramming algorithm from the proof of Theorem 2. 
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In conclusion, using mapping F'B is preferred to mapping F'B for ease 
of sampling reasons. This was not clear at the time we did the experiments?, 
but is now in retrospect. 


6 Conclusion and Acknowledgments 


In this paper we proved two related combinatorial results, Theorems 1 and 2, 
inspired by our earlier experimental work [9]. The first result has been shown 
to illuminate some experimental behavior in [9]; the second one can be used 
to improve the regeneration method from that paper. 

An interesting research question, suggested by an anonymous referee, 
in a more theoretical direction and is as follows: parameters LDS/SUS are 
intimately tied to the combinatorics of Young tableaux in semi-standard 
form [5]. The Schensted correspondence, essentially a recursive version of 
patience sorting, associates to every permutation a pair of Young tableaux 
of the same shape. It is tempting to wonder if congruences such as the be- 
havioral equivalence introduced in this paper and reconstruction algorithms 
such as the one presented here can be interpreted in light of this theory. 

I thank Anders Hansson for useful discussions, and an anonymous ref- 
eree for a careful reading and many useful suggestions. 

This work has been supported in part by CNCS IDEI Grant PN-IL-ID- 
PCE-2011-3-0981 ”Structure and computational difficulty in combinatorial 
optimization: an interdisciplinary approach”. 


References 


[1] D. Aldous and P. Diaconis. Longest increasing subsequences: from 
patience sorting to the Baik-Deift-Johansson theorem. Bulletin 
of the American Mathematical Society 36, pages 413-432, 1999. 
doi:10.1090/S0273-0979-99-00796-X. 


[2] J. Bennett, C. Partridge, and N. Shectman. Packet reordering is not 
pathological network behavior. IEEE/ACM Transactions on Network- 
ing, 7(6):789-798, 1999. doi:10.1109/90.811445. 


In fact, we experimented with both FB and FB, but the experimental results were 
very similar, so we saw no reason to present both versions 


Identifying Almost Sorted Permutations from 
TCP Buffer Dynamics 153 


[3] 


10 


11 


12 


J. Bellardo and S. Savage. Measuring packet reordering. In Proceedings 
of the 2nd ACM Workshop on Internet Measurement, pages 97-105, 
2002. doi:10.1145/637201 .637216. 


V. Estivill-Castro and D. Wood. A survey of adaptive sort- 
ing algorithms. ACM Computing Surveys, 24(4):441-476, 1992. 
doi:10.1145/146370. 146381. 


W. Fulton. Young tableaux: with applications to representation theory 
and geometry. Cambridge University Press, 1997. 


A. Hansson, G. Istrate, and S. Kasiviswanathan. Combinatorics of 
TCP reordering. Journal of Combinatorial Optimization, 12(1-2):57-— 
70, 2006. doi: 10.1007/s10878-006-8904-0. 


A. Hansson, G. Istrate, and G. Yan. Packet reordering metrics: Some 
methodological considerations. In Proceedings of the Second Interna- 
tional Conference on Networking and Services (ICNS’06). IEEE Com- 
puter Society Press, 2006. doi:10.1109/ICNS.2006.80. 


G. Istrate and A. Hansson. Counting preimages of TCP reorder- 
ing patterns. Discrete Applied Mathematics, 156(17):3187-3193, 2008. 
doi:10.1016/j.dam.2008.05.011. 


G. Istrate, A. Hansson, S. Thulasidasan, M. Marathe, and C. Bar- 
rett. Semantic compression of TCP traces. In Proceedings 
of the IFIP NETWORKING Conference, volume 3976 of Lecture 
Notes in Computer Science, pages 123-135. Springer Verlag, 2006. 
doi:10.1007/11753810_11. 


A. P. Jayasumana, N. M. Piratla, A. A. Bare, T. Banka, and R. Whit- 
ner. Improved packet reordering metrics. RFC 5236, 2008. 


J. Kurose and K. Ross. Computer Networking: A Top-Down Approach 
Featuring the Internet, Second Edition. Addison Wesley, 2003. 


M. Laor and L. Gendel. The effect of packet reordering in a backbone 
link on application throughput. IEEE Network 16(5), pp. 28-36, 2002. 
doi:10.1109/MNET.2002.1035115. 


C. Levcopoulos and O. Petersson. Sorting shuffled monotone 
sequences. Information and Computation, 112(1):37-50, 1994. 
doi:10.1006/inco.1994.1050. 


154 G. Istrate 


[14] A. Morton, L. Ciavattone, G. Ramachandran, S. Shalunov, and 
J. Perser. Packet reordering metric for IPPM. IETF RFC 4737, 2006. 


[15] N. M. Piratla, A. P. Jayasumana, and A. A. Bare. Reorder Density 
(RD): A formal, comprehensive metric for packet reordering. In Pro- 
ceedings of the IFIP Networking Conference 2005, volume 3462 of Lec- 
ture Notes in Computer Science, pages 78-89. Springer Verlag, 2005. 
doi:10.1007/11422778_7. 


[16 


= 


N. Piratla, A. Jayasumana, A. Bare, and T. Banka. Reorder buffer- 
occupancy density and its applications for measurement and evalua- 
tion of packet reordering. Computer Communications, 30(9):1980-1993, 
2007. doi:10.1016/j.comcom.2007.03.001. 


[17] W.R. Stevens. TCP/IP Illustrated. Addison Wesley, 1994. 


© Scientific Annals of Computer Science 2015 


